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ABSTRACT The three-dimensional structure of the 146- 
residue form of human basic fibroblast growth factor (bFGF), 
expressed as a recombinant protein in yeast, has been deter- 
mined by x-ray crystallography to a resolution of 1.8 A. bFGF 
is composed entirely of /2-sheet structure, comprising a three- 
fold repeat of a four-stranded antiparallel 0-meander. The 
topology of bFGF is identical to that of interleukin 1/3, showing 
that although the two proteins share only 10% sequence 
identity, bFGF, interleukin 1, and their homologs comprise a 
family of structurally related mitogenic factors. Analysis of the 
three-dimensional structure in light of functional studies of 
bFGF suggests that the receptor binding site and the positively 
charged heparin binding site correspond to adjacent but sep- 
arate loci on the /J- barrel. 



Basic fibroblast growth factor (bFGF) is the primary inducer 
of mesoderm formation in embryogenesis, modulating both 
cell proliferation and differentiation in vitro and in vivo (1,2). 
As a mitogen and chemoattractant, bFGF is a potent medi- 
ator of wound repair, angiogenesis, and neural outgrowth. 
These functions are mediated by interaction with high- 
affinity cell surface receptors (3, 4) and subsequent alter- 
ations in gene expression within responsive cells. Of broad 
tissue distribution, bFGF induces proliferation and migration 
of fibroblasts, endothelial cells, and astroglial cells. 

bFGF is one of at least seven proteins belonging to the 
heparin-binding growth factor (HBGF) family (5-7), which 
includes acidic FGF (aFGF). The two FGFs differ in isoelec- 
tric point (bFGF, pi = 9.6; aFGF, pi = 5-6; ref. 5) but share 
55% sequence identity and bind to the same receptors (8). At 
least two oncogenes, int-2 and hst/ks (from stomach tumors 
and Kaposi sarcoma) are also members of the HBGF family. 
Multiple forms of active bFGF, generated by amino-terminal 
proteolysis, have been isolated (9). Here we report the refined 
three-dimensional structure of recombinant 146-residue bFGF 
(10). The structure establishes bFGF as a member of the 
interleukin 1 (IL-1) family and provides a basis for interpre- 
tation of functional studies of bFGF with respect to its 
interaction with high-affinity receptors and with heparin. 

Expression and Structure Determination 

The synthesis in yeast of an acetylated 154-amino acid human 
FGF has been described (11). A truncated form, bFGF-(l- 
146) (10), corresponding to residues 9-154 of the full-length 
sequence was expressed as an intracellular product in yeast. 
We chemically synthesized the oligodeoxynucleotides 5'- 
CATGCCAGCCCTGCCGGAGGMCGGGGGCAGCGG-3' 
and 5'-CGCCGCTGCCCCCGTCCTCCGGCAGGGCTGG- 
3'. These complementary oligonucleotides were ligated to an 
Nar ISal I fragment encoding amino acids 11-146 of the 
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synthetic hFGF gene (11). The derivatized gene was cloned 
into the vector pBSlOO (12), which contains the alcohol 
dehydrogenase 2/glyceraldehyde-3-phosphate dehydroge- 
nase (ADH2/GAPDH) hybrid promoter. For expression in 
yeast, a BarnRlSal I fragment that contained the promoter- 
gene fusion was cloned into the yeast expression vector 
pBS24 (13). Yeast cells transformed with pBS24bFGF( 1-146) 
were propagated and induced for expression as described (12, 
13) and the recombinant bFGF was purified by heparin- 
Sepharose chromatography (11). 

Crystals of bFGF used for x-ray data collection were grown 
at room temperature by vapor diffusion. Macroseeded 10-//1 
sitting drops containing 0.3 mg of bFGF (30 mg/ml), 200 mM 
sodium succinate (pH 5.3), 60 mM dithiothreitol, and 40% 
saturated ammonium sulfate were deposited on silanized spot 
plates and equilibrated in a sealed chamber against a reservoir 
containing 60%. saturated ammonium sulfate- in the same 
buffer. bFGF crystals (space group P2{1{Iy, cell constants a = 
42.1 A, b = 86.6 A, c = 31.9 A) contain one molecule per 
asymmetric unit and diffract beyond 1.8-A resolution. 

Diffraction data were measured at room temperature with 
dual multiwire area detectors of Xuong-Hamlin design (14) 
with graphite monochromated CuKa x-rays generated by a 
Rigaku RU-200 rotating-anode source (operating at 50 kV, 
108 mA). Data were collected in 0.1° frames about the <o axis 
of an eulerian goniostat at a rate of 45 sec per frame. Data 
collection, reduction, merging, and scaling of symmetry- 
related intensities were accomplished with software by How- 
ard et aL (15). Merging statistics are summarized in Table 1. 

Experimental crystallographic phases determined by mul- 
tiple heavy-atom isomorphous replacement (MIR) were 
based primarily on two derivatives, prepared by soaking 
crystals in solutions containing KAu(CN)2, which binds to a 
single site, and EtHgCl, which binds to four sites. Isomor- 
phous and anomalous differences from both of these deriv- 
atives were used in phasing. A third derivative, obtained with 
K 2 PtCl 4 , was only weakly substituted. All heavy-atom phas- 
ing and refinement calculations were performed with the 
protein package of computer programs (16). Difference 
Fourier |F(native) - /^derivative)) maps, computed with 
single isomorphous difference phases from the gold deriva- 
tive, were used to refer all heavy-atom sites to a common 
origin. The correct hand of the map was chosen by inspection 
of /''(native) electron-density maps computed with both enan- 
tiomorphs of the heavy-atom constellation. Heavy-atom re- 
finement statistics are also reported in Table 1. Heavy-atom 
positions are given in Table 2. 



Abbreviations: FGF, fibroblast growth factor; aFGF, acidic FGF; 

bFGF, basic FGF; HBGF, heparin-binding growth factor; IL-1, 

interleukin 1; MIR, multiple isomorphous replacement. 
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^The atomic coordinates have been deposited in the Protein Data 

Bank, Chemistry Department, Brook haven National Laboratory, 

Upton, NY 11973 (reference 2 FGF). 
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Table 1. Data collection and heavy-auxn refinement statistics 





Native 


EtHgCl 


KAu(CN) 2 


K 2 PtCl 4 


Soaking cone, mM 




2.5 


10 


1 


Soaking time, hr 




72 


120 


12 


Resolution, A 


1.8 


2,3 


2.3 


2.9 


Measured 










reflections 


46,630 


42,074 


14,792 


20,264 


Unique 










reflections 


11,001 


9,681 


7,818 


4,923 


Emerge 


4.56 


7.79 


5.16 


4.27 


Completeness 


0.974 


0.969 


0.769 


0.999 


&F/F 




0.266 


0.323 


0.165 


^Cullia 




0.59 


0.71 


0.87 


FH/residual 




2.14 


1.31 


0.57 



Each data set was collected from a single crystal. The heavy-atom 
derivatives were refined with their anomalous data. The overall 
figure of merit was 0.65 to 2.3 A. The merging R factor is defined as 
Emerge =* [2*2: n (/ - </»/£*„</>] x 100%, where N is the number of 
unique measurements, n is the number of multiple measurements of 
a particular reflection; / is the measured and </> the mean intensity 
of a reflection. AF/F is the percent change between native (FO and 
scaled derivative (F 2 ) data (2||Fi| - |Fj|D/2|Fi|). /fonii. = 2(|F dcr - 
^php/SflFder - Fnatl), where Fnat, F der and F ph are the native, 
derivative, and calculated (phased) structure factors, respectively, 
and the summation is taken over the centric reflections. F H / residual 
is the average value of the heavy-atom contribution divided by the 
lack-of-closure error. 

An Fob S electron-density map computed at 2.5-A resolution 
with figure-of-merit-weighted MIR phases clearly revealed 
the path of the polypeptide chain comprising residues 18-143 
of bFGF (10). An initial model was developed using virtual 
atoms computed from a ridgeline representation of the elec- 
tron density by the program bones (17). An atomic model 
was obtained by fitting the C a model to fragments of poly- 
peptide chain derived from a data base of highly refined 
protein structures, using the dgnl option of fro do (18). 

The model was refined in three cycles (incorporat- 
ing crystallographic data in the resolution shells 8-2.5 A, 
6-2.0 A, and 6-1.8 A) of molecular dynamics-associated 
crystallographic refinement as implemented in the program 
x-plor (19). After each cycle, the model was refit to a 2F D b S 
~ ^"caic n^P computed with refined phases. A typical x-plor 
cycle consisted of 40 cycles of Powell minimization of 
stereochemical and x-ray terms, followed by molecular dy- 
namics simulation in which the molecule was allowed to cool 
slowly from an initial temperature of 2000 K (cycle 1), 1500 
K (cycle 2), or 1000 K (cycle 3) to a final temperature of 300 
K, using time steps of 0.5 fsec, with 25 K reductions in the 
temperature of the heat bath every 25 fsec. Restrained 
refinement of individual atomic temperature factors was 
carried out after the final cycle. 

The model at the present state of refinement comprises 
1019 protein atoms corresponding to residues 18-143 (10) and 
two ordered sulfate ions but no solvent water molecules. The 
root-mean- square (rms) deviation from ideal covalent bond 
distances and bond angles is 0.017 A and 2.9°, respectively, 
for the refined atomic model. The crystallographic R factor 
is 0.23 for the data extending from 6.0 A to 1.8 A. A volume 
of the 2F' obs — F'caic electron-density map computed with 
calculated phases from the present model is shown in Fig. 1. 
Solvent accessibility calculations were performed using a 
program written by T. Richmond (Yale University), follow- 
ing the method of Lee and Richards (20). 

Structure and Homology to IL-1/3 

bFGF is composed entirely of antiparallel /3-strands, each of 
which is hydrogen-bonded to /3-strands adjacent in the pri- 
mary sequence. The continuous 0-meander forms a barrel 
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Heavy-atorn^TCitions 



Derivative 


Relative • 
occupancy 


X 


y 


z 


EtHgCl 


4.S 


0.3336 


0.2314 


0.9097 




3.9 


U.JOUJ 


0.2729 


0.0521 




2.3 


0.3584 


0.2721 


0.0128 




0.7 


0.2812 


0.2388 


0.1440 


KAu(CN) 2 


7.6 


0.2634 


0.2308 


0.0536 


K 2 PtCU 


2.8 


0.3790 


0.2058 


0.3075 




1.9 


0.2898 


0.0919 


0.3478 



closed by the amino- and carboxyl-terminal strands (Fig. 2). 
The tertiary fold can be decomposed into three copies of a 
four-stranded /8-meander motif, corresponding to residues 
18-59 (repeat A), 60-100 (B), and 101-143 (C). The motifs are 
related by an axis of threefold symmetry, generating a closed 
barrel. The structural motifs, shown in Fig. 3 Left, are 
remarkably similar to each other, repeats A and B being 
particularly so. 

The first 17 amino-terminal residues of the molecule, rich 
in serine, glycine, and proline residues (Fig. 4) are not visible 
in the electron-density map and are apparently disordered. 
These residues would extend from the "top" of the barrel as 
depicted in Fig. 2. The carboxyl-terminal 3 residues are also 
disordered. The core of the bFGF molecule inaccessible to 
solvent is packed with hydrophobic and aromatic amino acid 
side chains, while the surface of the molecule is rich in 
charged amino acids, particularly arginine and lysine, as 
might be expected from the unusually high pi of the molecule. 

The tertiary structure of bFGF is topologically identical to 
that of IL-1/3 [refs. 23 and 24; entry 4I1B in the Protein Data 
Bank (22), submitted by B. Verrapandian, T. L. Poulos, 
G. L. Gilliland, R. Raag, L. A. Svensson, E. L. Winborne, 
Y. Masui, and Y. Hirai, April 15, 1990] as expected on the 
basis of the limited primary sequence similarity of these 




Fig. 1. A volume of the 1.8-A-resoIution 2F 0 ba ~ ^caic electron- 
density map computed with phases derived from the refined coor- 
dinates. Ser-113, Trp-114, and Tyr-115 are in the foreground, with 
Met-76, Arg-107, and Thr-112 in the background. The contour level 
of the map is 1.65 standard deviations above the mean. 
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Fig. 2. Ribbon diagram illustrating the tertiary and secondary 
structure of bFGF. 0-Sheet secondary structure is depicted with 
flattened ribbons. The polypeptide segment (residues 105-115) im- 
plicated in receptor recognition is shown in red (see text). The 
diagram was produced using the computer program ribbons by M. 
Carson (University of Alabama, Birmingham). 

proteins (25, 26). The 96 residues comprising the /S-cores of 
bFGF and IL-10 can be superimposed with a rms deviation 
between corresponding C a positions of 1.55 A. 

The structural superposition, performed both by inspec- 
tion and automatically using the program O (21), leads to the 
primary sequence alignment presented in Fig. 4 and allows 
the definition of "structure conserved sequences" (27, 28). 
The peptides contained within these sequences adopt similar 
</> and tp peptide torsion angles and can be superimposed with 
rms deviations of < 1.0 A. This alignment differs significantly 
from that of Gimenez-Gallego et aL (25). 
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Favorable structural alignments can also be obtained be- 
tween IL-1/8 and circular permutations of the repeat stucture 
of bFGF. Rotation of bFGF such that repeats CAB are 
aligned in order with IL-10 ABC gives 109 C a atoms super- 
imposed with rms deviations of 1.98 A; alignment of bFGF 
BCA with IL-1/3 yields 101 C° atoms superimposed withurms 
deviations of 1.71 A. However, neither of these alignments 
generate as many primary sequence matches or as optimal an 
overlap as does direct superposition. 

bFGF contains four cysteine residues, of which two, 
residues 25 and 92, are conserved in the HBGF family (2). 
Cys-87 and -92 are positioned on opposite ends of a six- 
residue turn. None of the four cysteines are involved in 
disulfide bridges. While the S Y atoms of Cys-87 and -92 are 
only 7.3 A apart, the formation of a disulfide bond between 
them would require a substantial conformational change 
within the loop. Cys-25 and -69 occupy nearly identical 
positions in the primary sequence with respect to the under- 
lying topological repeat of the molecule (Fig. 3 Left), 

bFGF forms oligomers, particularly dimers, under nonre- 
ducing conditions. Dimerization possibly occurs between 
Cys-69 and -87, which are accessible to solvent in the crystal 
structure and which can be modified by carboxymethylation 
(29). In contrast, Cys-25 and -92 are totally inaccessible to 
solvent in the three-dimensional structure. The formation of 
intermolecular disulfide bridges involving residues 25 and 92 
seems unlikely on steric grounds. All four cysteine residues 
can be replaced by serine residues without loss of biological 
activity (30, 31), although replacement of Cys-25 and -92 
reduces the affinity of bFGF for heparin (30), suggesting that 
minor structural perturbations may result from the substitu- 
tion of these buried cysteines. 

Location of Potential Receptor and Heparin Binding Sites 

Residues 106-115 may form part of the bFGF binding site for 
the high-affinity receptors (32). These residues form a dis- 
torted antiparallel /3-turn on the surface of the molecule (Figs. 
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Fic. 3. {Left) Superposition of bFGF residues 18-58 in blue (repeat A), 60-100 in red (repeat B), and 101-143 in yellow. Forty-two C a atoms 
in repeat A can be superimposed onto corresponding positions in repeat B with rms deviation of 1.24 A; 37 atoms in repeat A can be superimposed 
with those in repeat C with rms deviation of 1.34 A. The side-chain atoms of the four cysteine residues in bFGF are shown in green and labeled 
(SG, for S"0. Superposition calculations were performed using the program O (21). (Right) Optimal spatial superposition of the intact C a 
structures of bFGF (blue) and IL-10 (yellow). The alignment superimposes 96 atoms with rms deviation of 1.55 A. # 
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20 (7) 



40 (26) 



bFGF . . . .KDPKRLYCKNGGFFLRIHPDGRVDGVREKSDPHI. . . KLQLQAE 
I Lib APVRSLNCTLRDS .<X2KSLVMSGPYELKALHLQGQDMEQQWFSMSFV 



60 (55) 80 C78) 

I I 

bFGF ER GWSIKGVCANRYLAMK . . ED . GRLLASKCVT D 

ILlb QGEE SND KI P VALG LKEKNLYLSC VLKDDKP TLQLE S VD P KN YP KKKMEK 



100 (106) 120 (126) 

I I 
bFGF ECFFFERLESNNYNTYRSRKYTSWYVALKRTGQYKLGSFTGPGQK . AILF 
ILlb RFVFNKIEI . NNKLEFESAQFPNWYISTSQAENMPVFLGGTKGGQD ITDF 



143 (150) 

bFGF LPMS 
ILlb TMQFVS 

Fig. 4. Alignment of the primary sequences of bFGF and human 
IL-1/3 (IL-lb). The C a coordinates of IL-1/3 were obtained from entry 
1I1B from the Protein Data Bank (22) and superimposed with 
corresponding positions in bFGF. The C a positions of corresponding 
residues shown in boldface are separated by <1 A on superposition 
of the polypeptide subsequences in which they are contained. The 
alignment is identical to that illustrated in Fig. 3 Right. 

2 and 5). Tyr-114 and Trp-115, implicated as strong binding 
determinants by peptide competition experiments with syn- 
thetic peptides (A. Baird, personal communication), expose 
80 A 2 of solvent-accessible surface (20) (versus the «260 A 2 
expected if the dipeptide were fully exposed). The latter two 
residues are conserved in the primary sequence of IL-1/3. 
Further, the section of polypeptide chain in which Tyr-114 
and Trp-115 are contained forms the longest block of con- 
secutive residues in bFGF that can be superimposed with a 
corresponding sequence in IL-1/3 (Fig. 4). FGF and the IL-1 
isoforms may thus present similar and corresponding binding 
surfaces to the extracellular domains of their receptors, both 
of which belong to the immunoglobulin superfamily (3). 

Phosphorylation of bFGF at Thr-112 by protein kinase A 
increases the affinity of the growth factor for the FGF 
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receptor by a factoroT3-8 (33). The phosphorylation site is 
contained within the putative receptor binding sequence (Fig. 
5). Two nearby arginine residues, 107 and 109, could serve as 
potential ligands to the phosphothreonine. Modulation of 
receptor affinity might occur as a direct consequence of 
electrostatic interactions with Thr(/ > )-112 or as a result of 
conformational changes induced by phosphorylation. For 
example, reorientation of Arg-109 to form an ion pair with the 
phosphothreonine would sever the Arg-109 salt bridge with 
Glu-57. The activation of glycogen phosphorylase by phos- 
phorylation is triggered by an analogous structural mecha- 
nism (34). 

Basic and acidic FGF are functionally distinguished by 
their interaction with heparin. This heterogeneous, sulfated 
anionic polysaccharide protects both aFGF and bFGF from 
inactivation by low and high temperature and by acid con- 
ditions (35). Heparin potentiates the mitogenic activity of 
aFGF (5). Removal of 40 amino acids from the amino 
terminus does not reduce heparin binding. In contrast, de- 
letion of the carboxyl-terminal 40-42 residues of bFGF, a 
region that is rich in conserved basic residues in both aFGF 
and bFGF, greatly diminishes heparin affinity (36) but not the 
ability of bFGF to stimulate DNA synthesis. Reductive 
methylation of Lys-118 in aFGF (corresponding to Lys-125 in 
bFGF; ref. 37), or replacement of that residue with glutamate 
by site-directed mutagenesis (38) substantially decreases the 
affinity of aFGF for heparin but does not affect its affinity for 
receptors. These data suggest that the third (C) repeat of FGF 
contains the heparin binding site. 

The binding site for anionic heparin sulfate is likely to be 
rich in positively charged lysine and arginine residues. How- 
ever, at least 24 basic residues are exposed on the surface of 
bFGF. Of those conserved between acidic and basic FGF, 5 
residues within the carboxyl-terminal repeat of the molecule 
(nos. 119, 120, 125, 129, and 135) form a prominent cluster. 
The quaternary amino groups of Lys-119, -125, and -129 form 
a nearly equilateral triangle with a mean N f -N f distance of 8 
A. The two ordered sulfate ions in the crystal structure, one 
forming ionic contacts with Lys-119 and Lys-129, and the 
second linking Arg-120 and Lys-125, might mimic heparin 
sulfate moieties (Fig. 5). 

A contour map of the electrostatic potential surface (39) 
calculated at neutral pH shows a radial and asymmetric 
distribution of positive charge perpendicular to the pseudo- 
threefold axis of molecular symmetry (Fig. 6). The most 
prominent center of positive charge corresponds to the group 
of conserved lysine and arginine residues described above. 




Fig. 5. Side chains of bFGF residues discussed in the text: Glu-59 (ion-pair partner of Arg-109); Ser-64 (protein kinase C phosphorylation 
site); Arg-107, Arg-109, Lys-110, Tyr-111, Thr-112, Trp-114, and Tyr-115 (putative receptor-binding loop; Thr-112 (protein kinase A 
phosphorylation site), Lys-119, Arg-120, Lys-125, and Lys-129 (putative heparin binding site). Sulfur atoms of Cys-25, -69, -87, and -93 are shown 
with van der Waals dot surfaces in yellow. Oxygen atoms are shown in red, nitrogen atoms in magenta, carbon atoms in green, and sulfur in 
yellow. The O* backbone is colored cyan. Residues 18 and 143 are labeled. Sulfate ions are shown with van der Waals surfaces in cyan. 
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Fig. 6. Electrostatic potential surface computed using the pro- 
gram Delphi (38) at the solvent-accessible surface of bFGF. A 
coulombic boundary condition was used with inner and outer di- 
electric constants of 2-0 and 80, respectively, an ionic strength of 

0. 145 M in the solvent region, and a Debye length of 8.0 A. The 
negative (red) and positive (blue) potentials are contoured at 2kT. 
The ribbon shown in yellow follows the path of the polypeptide chain 
and the portion of the ribbon in orange (left) marks the position of 
residues 106-116. The view is perpendicular to the molecular three- 
fold axis, similar to that shown in Fig. 5. 

These are Located adjacent to the putative receptor-binding 
loop (residues 106-115). The apparent spatial separation of 
the putative heparin and receptor binding sites is consistent 
with the functional independence of receptor and heparin 
binding (36, 38) and with the results of inhibition studies with 
neutralizing antibodies (40). 

Most of the residues conserved by aFGF and bFGF are 
inaccessible to solvent and participate in the stabilization of 
the hydrophobic core. A conserved but predominantly hy- 
drophobic cluster comprising the carboxyl-terminal triad 
Phe-139/Leu-140/Pro-141, as well as Tyr-104 and Tyr-24-, is 
exposed to solvent but also appears to serve a structural role, 
since deletion of the carboxyl-terminal five residues desta- 
bilizes and partially, inactivates bFGF (36). In general, the 
sites of sequence identity may be more indicative of struc- 
tural rather than functional conservation (41). 

Note, Two reports of the three-dimensional structure of bFGF have 
appeared since this manuscript was submitted: an account of the 
structure of 146-residue bFGF crystallized in space group PI at basic 
pH [ref. 42 (the paper immediately preceding this one)] and a 
description of the structures of analogs of acidic and basic FGF (43). 
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